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Nonparametric statistical methods 

Parametric statistical methods are implemented based on definite assumptions. For 
example, an independent f-test requires conditions of normal distributions for each 
group. The normality of the population distribution from which the sample data was 
originated may be assessed by examining graphs, by formal normality test such as the 
Shapiro-Wilk test or Kolmogorov-Smirnov test, or by assessment using skewness and 
kurtosis, as discussed in a previous session. When a substantial departure is found 
in data with continuous outcomes, we may consider transformation of data, e.g., 
by taking logarithms. If transformed data satisfy the assumptions, usual parametric 
methods, such as f-test, ANOVA, or linear regression may be applied. However if 
transformed data do not satisfy the assumptions, we need to choose nonparametric 
methods. 

Nonparametric methods are called as 'distribution-free tests' because generally they 
don't require any assumptions about underlying population distribution. Nonparametric 
methods may be applied when the data do not satisfy the distributional requirements 
of parametric methods. In nonparametric methods, ranks of observations are used 
instead of the measurements themselves, which may cause somewhat loss of 
information. While parametric methods estimate the key parameters of the distribution 
such as population means, nonparametric methods mainly test the pre-set hypothesis, 
e.g., whether two data differ or not. As the result, nonparametric methods usually 
don't provide any useful parameter estimates. 

Nonparametric methods may be applied on a wide range of data which is not suitable 
for the parametric analysis. Severely skewed continuous data or continuous data with 
definite outliers may be analyzed using nonparametric methods. Small-sized samples 
which don't guarantee the exact normality need nonparametric methods. Also data with 
ordinal outcomes are often analyzed using nonparametric methods. 

Advantages of nonparametric methods are as follows: 

1) Wider range of data are applicable because of no or limited assumptions. 

2) Useful in analyzing ordered data or categorical data 

Disadvantages of nonparametric methods are as follows: 

1) Applying nonparametric methods on data satisfying assumptions of parametric 
methods may result in less power. However loss of power may be little or none for 
analysis of data which cannot satisfy the assumptions. 

2) Tied values can be problematic and appropriate adjustment may be required. 

3) Limited flexibility in generalizing to more complex analytic methods compared to 
parametric methods. 
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Nonparametric methods for comparing two groups 

Table 1 shows parametric methods and nonparametric methods for comparing independent two groups and correlated two 
groups. The Wincoxon rank-sum test (or the Mann-Whitney U test) is comparable to the independent f-test as the Wilcoxon 
signed-rank test is to the paired f-test. 



Table 1. Parametric methods and nonparametric methods for comparing two or more groups 



Comparison 


Parametric method 
(normal distribution assumed) 


Nonparametric method 
(distribution-free method) 


Independent two groups 


Independent t-test 


Wilcoxon rank-sum test 
(or Mann-Whitney U test) 


Correlated two groups 


Paired t-test 


Wilcoxon signed-rank test 


Independent three or more groups 


One-way ANOVA 


Kruskal-Wallis test 


Correlated three or more groups 


Repeated measures one-way ANOVA 


Friedman test 



1. Wilcoxon rank-sum test (or Mann-Whitney U test) 

The Wilcoxon rank-sum test (or the Mann-Whitney U test) is applied to the comparison of two independent data whose 
measurements are at least ordinal. The null hypothesis is that two sets of scores are samples from the same population; 
therefore they do not differ systematically. Steps of the test are as follows: 

1 st step: Transformation of observed scores into ranks 





Observed score 






Rank 




Groups 


A B 


Groups 


A 




B 



11 2 7 2 

14 9 1=^ ^ 8 6 
Data Data 

7 0 4 1 

8 5 5 3 

Sum of ranks 24 12 



2 n step: W (Difference from the rank sum of A group to the rank sum of B group) = 24 - 12 = 12 
3 rd step: Calculation of the probability of the observed case or more extreme cases 

a) Number of all possible cases when 4 observations were selected from the set with 8 observations: 

8! / (4! * 4!) = 70 

b) Number of cases with larger difference of rank sum from A group to B group = 4 cases 
Rank combinations of A group with (5, 6, 7, 8), (4, 6, 7, 8), (3, 6, 7, 8), and (4, 5, 7, 8). 

4 th step: Calculation of probability 

a) p value for one-sided test = 4 / 70 = 0.057 

b) p value for two-sided test = 2 * (4 / 70) = 0.114 (exact p value) 

If sample size is large (> 20), asymptotic p value (below e) is used for the statistical determination. 
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The Wilcoxon rank-sum test using the SPSS statistical package is according to following procedures: 



(a) data 



H Untitled - SPSS Data Editor 
File Edit View Data Transform Analyze Grap 





Group 


Value 


vac 


1 


1 


11 




2 


1 


14 




3 


1 


7 




4 


1 


8 




5 


2 


2 




6 


2 


9 




7 


2 


0 




8 


2 


5 













(b) Analyze - Nonparametric tests 
- 2 Independent samples 





Reports ► 
Descriptive Statistics ► 
Tables ► 
Compare Means ► 
General Linear Model ► 
Mixed Models ► 
Correlate ► 
Regression ► 
Loglinear ► 
Classify ► 
Data Reduction ► 
Scale ► 








var 


var 






























































































Nonparametric Tests ► 




Chi-Square... 

Binomial... 

Runs... 

1-Sample K-S... 




Time Series ► 
Survival ► 
Multiple Response ► 




Missing Value Analysis... 


2 Independent Samples... 










K Independent Samples... 
2 Related Samples... 
K Related Samples... 





















(c) Define test variable and grouping variable 

±3 Two-Independent-Samples Tests 



Test Variable List: 



Value 



( . Grouping Variable: 

rn iGroupd 2) 

Define Groups,,. | 



Reset | 
Cancel | 
Help | 



Test Type 

fy Mann-Whitney U 

r Moses extreme reactions 



r Kolmogorov-Smirnov Z 
r Wald-Wolfowitz runs 



Exact... | Options... | 



(d) Sum of ranks 



(e) Wilcoxon W and p values 



Ranks 



Group 


N 


Mean Rank 


Sum of 
Ranks 


Value 1 


4 


6.00 


24.00 


2 


4 


3.00 


12.00 


Total 


8 







Test Statistics 6 





Value 


Mann-Whitney U 
Wilcoxon W 
Z 

Asymp. Sig. (2-tailed) 
Exact Sig. [2*(1 -tailed 
Sig.)] 


2.000 
12.000 
-1.732 
.083 

.114 a 



a. Not corrected for ties. 

b. Grouping Variable: Group 



2. Wilcoxon signed-rank test 

The Wilcoxon signed-rank test is applied to the comparison of two repeated or correlated data whose measurements are at 
least ordinal. The null hypothesis is that there is no difference in two paired scores. Steps of the test are as follows: 
1 st step: Compute difference scores between two paired values 
2 nd step: Assign ranks from the smallest absolute difference to the largest one 

3 rd step: Provide signs, + or -, according to the direction of the differences and compute sum of positive and negative 
ranks, respectively. 

4 th step: Set T = Smaller absolute value of positive or negative rank sum. T = 4 (negative rank sum) 
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An example of calculating sum of positive and negative ranks is shown in the Table 2. 



Table 2. Procedures from 1 st step to 3 rd step of the Wilcoxon signed-rank test 



No 


Before 


After 


Difference 


Ranks 


Positive 


Negative 


1 


36 


21 


15 


11 


11 




2 


23 


24 


-1 


-1 




1 


3 


48 


36 


12 


10 


10 




4 


54 


30 


24 


12 


12 




5 


40 


32 


8 


7 


7 




6 


32 


35 


-3 


-3 




3 


7 


50 


43 


7 


6 


6 




8 


44 


40 


4 


4 


4 




9 


36 


30 


6 


5 


5 




10 


29 


27 


2 


2 


2 




11 


33 


22 


11 


9 


9 




12 


45 


36 


9 


8 


8 




Sum 










74 


4(= T) 



5* step: Wilcoxon signed rank test 

a) Small sample size (N < 20): find the p value for T value and sample size n in the table for the Wilcoxon signed- 
rank test. 

For this example, p value = 0.0017 for T = 4 when sample size = 12. 

b) Larger ample size (N > 20): use z distribution approximation. 

. N(N-l) 



l N(N+ 1)(2/V+ 1) 
I 24 

Statistical decision: reject null hypothesis if |z| > 1.96 at the significance level a = 0.05. If the sample size was 
larger than 20, the asymptotic p value 0.006 (below e) could be used. 
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The Wilcoxon signed-rank test using the SPSS statistical package is according to following procedures: 



(a) data 



j Untitled - SPSS Data Editor 

le Edit View Data Transform Analyze Grapr 



g|H|g| jg| q|H m&i ftl j 



21: 



T 





Pre_score 


Post_score 


var 


1 


36 


21 




2 


23 


24 




3 


48 


36 




4 


54 


30 




5 


40 


32 




6 


32 


35 




7 


50 


43 




8 


44 


40 




9 


36 


30 




10 


29 


27 




11 


33 


22 




12 


45 


36 













(b) Analyze - Nonparametric tests 
- 2 Related samples 



[ Analyze ) Graphs Utilities Window Help 



(c) Define test pair list 





Reports ► 
Descriptive Statistics ► 
Tables ► 
Compare Means ► 
General Linear Model ► 
Mixed Models ► 
Correlate ► 
Regression ► 
Loglinear ► 
Classify ► 
Data Reduction ► 
Scale ► 








var 


var 




























































































Nonparametric Tests ► 




Chi-Square... 

Binomial... 

Runs... 

1-Sample K-S... 

2 Independent Samples... 

K Independent Samples... 




Time Series ► 
Survival ► 
Multiple Response ► 
Missing Value Analysis... 














2 Related Samples... 








K Related Samples... 



* 3 Two-Related-Samples T 



Current Selections 
Variable I: Post_score 
Variable 2: 



Test Pair(s) List: 
Pre.score -- Pos 



Test Type 



| OK 



(7 Wilcoxon r Sign r McNemar 
r Marginal Homogeneity 



Exact... | Options... 



(d) Mean ranks (positive, negative) 



(e) Z-value and p values 



Ranks 





N 


Mean Rank 


Sum of 
Ranks 


PosLscore - Pre.score Negative Ranks 


10 a 


7,40 


74.00 


Positive Ranks 


2* 


2,00 


4,00 


Ties 


0° 






Total 


12 







a. PosLscore < Pre.score 

b. Post.score > Pre.score 

c. Post.score = Pre.score 



Test Statistics 6 





PosLscore - 
Pre.score 


1 

Asymp. Sig. (2-tailed) 


-2.746 a 
.006 



a. Based on positive ranks. 

b. Wilcoxon Signed Ranks Test 
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